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ABSTRACT 


The emergence of drug resistant strains of Mycobacterium tuberculosis necessitates the discovery of new molecular scaffolds a priority, and the current 
situation even necessitates the re-engineering and repositioning of some old drug families to achieve effective control. Due to its essential nature and 
coupled with the absence of a human homolog, an essential and uniquely prokaryotic enzyme alanine racemase has been pursued as a target for 
antimycobacterial drug discovery. Biological assembly of obligatory dimer functional unit is modeled. D-Cycloserine is a rigid analog of D-alanine is 
chosen as scaffold for the rational design of new analogs. In this study, we present a unified approach involving homology modeling and molecular 
docking studies. Collectively, the results demonstrate the feasibility of using homology modeling and molecular docking studies to obtain novel alanine 
racemase inhibitor lead compounds that are potentially useful for development as antimycobacterial agents. It should be noted that this computational 
predicted data should be validated using suitable assays for further consideration. 
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Abbreviations: PDB - Protein Data Bank, kDa-Kilodalton, DOTS - Directly Observed Therapy Short Course, RMS-Root Mean Square, ADME/T - 
Absorption, Distribution, Metabolism, Excretion and Toxicity 


1. INTRODUCTION 


Once thought to be on the decline, Tuberculosis (TB) still 
remains a major global health problem. It ranks as the 
second leading cause of death from an infectious disease 
worldwide, after the human immunodeficiency virus (HIV). 
TB is an infectious disease caused by the bacillus 
Mycobacterium tuberculosis (Mtb). It typically affects the 
lungs (pulmonary TB) but can affect other sites as well 
(extrapulmonary TB) (WHO 2012). The global burden of TB 
remains enormous. In 2011, there were an estimated 8.7 
million new cases of TB (13% co-infected with HIV) and 1.4 
million people died from TB (WHO 2012). This situation is 
further exacerbated by the emergence of multi-drug resistant 
form of TB (MDR-TB) and extensively drug-resistant TB 
(XDR-TB). 


Tuberculosis 

1.Once thought to be on the decline, tuberculosis still remains a major global health 
problem today 

2.The emergence of multidrug resistant strains and persistence nature of 
Mycobacterium tuberculosis has caused stringent need to search novel drug 
targets 

3.Non-homologous proteins of metabolic pathways are first preference for effective 
drug designing to avoid the deceptive targeting and side-effects in host parasite 
diseases 

4.Due to its essential nature, coupled with the absence of a human homolog, an 
essential and uniquely prokaryotic enzyme alanine racemase has long been an 
attractive drug target for antimicrobial drug discovery. 

5.The developed model showed a good overall structural quality and stereochemical 
parameters compliance as confirmed by different validation tools 

6.Lead molecules were generated and prioritized to follow Lipinski’s rule-of-five 
based on the drug likeliness properties 

7.The scheme of the study has designed and identified several novel non-substrate 
alanine racemase inhibitors 


Today's TB drug regimen takes too long to cure, and is 
too complicated to comply with. Despite the flaws with and 
growing resistance to current TB treatments, no new TB 
drugs have been developed in nearly 50 years (Tomioka 
and Namba 2006; TBAlliance 2012). The poor efficiency of 
identifying new TB drugs by screening pharmaceutical 
library collections has been linked to the limited chemical 
diversity within these collections (Payne, Gwynn et al. 2007). 
In addition, many new antibiotic candidates are chemical 
molecules re-engineered from old drug classes discovered 
decades ago. This approach has identified new TB drugs 
from existing antibacterial drug classes and either involved 
the redesign of accessible scaffolds to improve their 
antimycobacterial potencies or more directly, the 
repositioning of known antibacterial drugs with good 
antimycobacterial activity (Koul, Arnoult et al. 2011). During 
re-engineering of known scaffolds, chemical modifications 
are introduced into the core structure that may lead to 
improved bactericidal activities, better resistance profiles, 
safety, tolerability or superior © pharmacokinetic/ 
pharmacodynamic properties (Koul, Arnoult et al. 2011). 

The bacterial cell wall is an ideal target for drug design 
since similar structures and biosynthetic pathways are 
absent from mammalian hosts (Feng and Barletta 2003). D- 
alanine is one of the central molecules of the cross-linking 
step of peptidoglycan assembly. In an effort to discover new 
drugs to treat TB we chose Alanine racemase (Alr) as the 
target of our drug discovery efforts. In Mtb, alanine 
racemase (EC 5.1.1.1), a pyridoxal 5’-phosphate (PLP)- 
containing enzyme catalyzes the racemization of L-alanine 
to D-alanine, which is an essential precursor for the 
pentapeptide cross-bridge of the peptidoglycan layer in 
bacterial cell wall (Anthony, Strych et al. 2011). Alr was 
considered for drug designing due its role in cell wall 
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Homology modeling: 
Predicting the tertiary 
structure of an unknown 
based on the known 
coordinates of a protein to 
which it is homologous i.e., 
has a high degree of 
sequence identity/similarity 


Dimer: 

Dimer is a macromolecular 
complex formed by two, 
usually non-covalently 
bound, macromolecules 
like proteins. It is a 
quaternary structure of a 
protein. A homo-dimer 
would be formed by two 
identical molecules and a 
hetero-dimer would be 
formed by two different 
macromolecules 


Docking: 

Docking is an energy- 
based operation for 
exploring the binding 
modes of two interaction 
molecules. Docking is 
frequently used to predict 
the binding orientation of 
small molecule drug 
candidates to their protein 
targets in order to in turn 
predict the affinity and 
activity of the small 
molecule. 
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synthesis, cell wall organization, alanine metabolic process, 
alanine racemase activity, and pyridoxal phosphate binding. 
This enzyme is ubiquitous among prokaryotes, and with very 
few exceptions are absent in eukaryotes, making them a 
logical target for the development of novel antibiotics 
(Strych, Penland et al. 2001). The active form of the enzyme 
is an obligatory dimer containing two monomers of 43 kDa in 
head-to-tail orientation. Residues from both monomers 
contribute to the two active sites, where PLP and alanine 
bind (LeMagueres, Im et al. 2005; Anthony, Strych et al. 
2011). Due to its essential nature, coupled with the absence 
of a human homolog, an essential and uniquely prokaryotic 
enzyme Alr has long been an attractive drug target and has 
long been pursued as a target for antimicrobial drug 
discovery. 

The emergence of emergence of MDR-TB and XDR-TB 
makes the discovery of new molecular scaffolds a priority, 
and the current situation even necessitates the re- 
engineering and repositioning of some old drug families to 
achieve effective control (Koul, Arnoult et al. 2011). 
Considering the urgency of the situation, drug discovery 
strategy for the incremental improvements of existing 
scaffolds is to fill a drug development pipeline is chosen for 
the purpose of this study. D-Cycloserine (DCS; 4-amino-3- 
isoxazolidinone) is a rigid analog of D-alanine. It interferes 
with an early step in bacterial cell wall synthesis in the 
cytoplasm by competitive inhibition of two enzymes, L- 
alanine racemase, which forms D-alanine from L-alanine, 
and D-alanylalanine synthetase, which incorporates D- 
alanine into the pentapeptide necessary for peptidoglycan 
formation and bacterial cell wall synthesis (Knox, Law et al. 
2011; DrugBank 2012). DCS is effective against 
mycobacteria and is indicated in the treatment of multidrug- 
resistant active pulmonary and extra-pulmonary TB in the 
DOTS-Plus management plan. The potent bactericidal effect 
of DCS against mycobacteria makes this drug an attractive 
prototype compound to develop novel antimycobacterial 
agents (Feng and Barletta 2003). In addition, identification of 
the lethal target(s) of DCS action would allow for the rational 
design of new antimycobacterial drugs, structurally related 
or unrelated to DCS, targeting enzymes of the D-alanine 
pathway of peptidoglycan biosynthesis (Feng and Barletta 
2003). Moreover, this type of inhibitors may weaken the cell 
wall and act synergistically with other antimicrobial agents 
(Feng and Barletta 2003). 


2. OBJECTIVE AND SCOPE OF THE 
STUDY 


Advances in the identification of new TB drug targets have 
been driven largely by the availability of the genome 
sequence of Mtb (Koul, Arnoult et al. 2011). The main 
objective of the present research is to develop a unified 
approach involving homology modeling and molecular 
docking studies on Mtb-Alr. We hope the present work forms 
the basis for further molecular modeling and biochemical 
studies on targeting the Mtb-Alr enzyme for therapy, and 
conclusively reveal a potent lead compound based on best 
values of docking energy. 


3. MATERIALS AND METHODOLOGY 


The various computational software programs have been 
appropriately referred in the framework of the study and are 
mostly based on Windows or Linux operating system. The 
methodology pursued is sequentially represented in the 
research scheme below. 


3.1. Sequence Analysis 

The amino acid sequence of Mtb-Alr in FASTA format was 
retrieved from NCBI (NCBIGenBank: CAB01033.1; Cole, 
Brosch et al. 1998). The Rv3423c Gene Info, Mtb-Alr protein 
sequence and Mtb-Alr genomic of Mtb H37Rv were also 
retrieved from Tuberculosis Data Base for cross verification 
and validation (Reddy, Riley et al. 2009). Sequence 
alignment search was done using BLASTP program (version 
BLASTP. 2.2.26) at NCBI (Altschul, Madden et al. 1997). 
After choosing template with highest similarity to Mtb-Alr 
sequence, pairwise sequence alignment was performed 
using programme EMBOSS_ Needle available at 
http:/Awww.ebi.ac.uk/Tools/psa/. Multiple sequence 


alignment is performed using the program ClustalW (version 
2) through web form available at 
http://www. ebi.ac.uk/Tools/msa/ClustalW2/ (Thompson, 
Gibson et al. 2002; Chenna, Sugawara et al. 2003; Larkin 
MA 2007; Goujon M 2010). 


3.2. Homology Modeling 

MODELLER is used for homology or comparative modeling 
of protein three-dimensional structures (Marti-Renom, Stuart 
et al. 2000; Fiser and Sali 2003; Eswar, Webb et al. 2006; 
Eswar, Webb et al. 2007). The three dimensional model of 
Mtb-Alr was built using the crystal structure coordinates of 
PDB ID: 1XFC A-chain. All steps of homology modeling 
were performed using MODELLER 9 v10 software. Atom 
files with coordinates for the template structures, the 
alignment file with the alignment of the template structures 
with the target sequence, and MODELLER commands in a 
script file were generated and run through MODELLER 
9v10. Nearly 100 runs of model building were carried out in 
order to obtain the most reasonable homology model of Mtb- 
Alr and a preliminary structural investigation was done on 
the graphic screen to scrutinize the reliability of the 
alignment. Structural refinement of the developed model, 
visualization, superpositions and alignments was performed 
using UCSF Chimera version 1.6.1 (Pettersen, Goddard et 
al. 2004; Goddard, Huang et al. 2005; Meng, Pettersen et al. 
2006; Goddard, Huang et al. 2007; Yang, Lasker et al. 
2011). 


3.3. Validation of the Generated 
Homology Model 


The built model of Mtb-Alr was assessed using protein 
structure and model assessment tools available at 
SwissModel Workspace over http://swissmodel.expasy.org 
(Schwede, Kopp et al. 2003; Arnold, Bordoli et al. 2006; 
Bordoli, Kiefer et al. 2009; Bordoli and Schwede 2012). 
QMEANE6 score is used to evaluate the generated Mtb-Alr 
homology model (Benkert, Tosatto et al. 2008; Benkert, 
Kunzli et al. 2009; Benkert, Tosatto et al. 2009). The atomic 
empirical mean force potential ANOLEA (Atomic Non-Local 
Environment Assessment) is used to assess packing quality 
of the model (Melo, Devos et al. 1997; Melo and Feytmans, 
1998). GROMOS is a general-purpose molecular dynamics 
computer simulation package is used for the analysis of 
conformations (Christen, Hunenberger et al. 2005). The 
PROCHECK v.3.5 suite of programs is used to assess the 
"stereochemical quality" of the built Mtb-Alr homology model 
available at http://www. ebi.ac.uk/thornton- 
srv/software/PROCHECK/ (Laskowski 1993; Laskowski, 
Rullmannn et al. 1996). A set of WHAT IF checks on the 
Mtb-Alr model was performed using servers at 
http://swift.cmbi.ru.nl/servers/html/index.html (Vriend, 1990; 
Hooft, 1996). A set of ProSA checks on Mtb-Alr homology 
model were performed using’ server located at 
https://prosa.services.came.sbg.ac.at/prosa.php (Sippl, 
1993; Wiederstein and Sippl 2007). 


3.4. The Secondary Structure Analysis 
The secondary structure analysis of Mtb-Alr model was 
carried out through PDBsum_ online’ server at 
http://www.ebi.ac.uk/pdbsum/ (Laskowski, Hutchinson et al. 
1997; Laskowski 2001; Laskowski, Chistyakov et al. 2005; 
Laskowski 2007; Laskowski 2009). 


3.5. Domain Analysis and Ligand 
Binding Site Analysis 

For classifying the Mtb-Alr model sequence at superfamily, 
family and subfamily levels, and predicting the occurrence of 
functional domains a sequence search of InterProScan 
(Version 4.8), via  InterProScan web server at 
http://www. ebi.ac.uk/Tools/pfa/iprscan/ is performed 
(Zdobnov and Apweiler 2001; Quevillon, Silventoinen et al. 
2005; Mulder and Apweiler 2007; Kelly, Vincent et al. 2010). 
For cross verification a Scansite (Ver 2.0) of Mtb-Alr model 
sequence for Motifs was performed at Scansite web server 
over http://scansite3.mit.edu/#home (Obenauer, Cantley et 
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Table 1 QMEANG score of the bui 


It Mtb-Alr homology model 


C-beta interaction energy* -132.48 -0.04 
All-atom pairwise energy? -9339.65 -0.05 
Solvation energy® -40.39 0.37 
Torsion angle energy? 122.09 0.88 
Secondary structure agreement® 80.6% -0.11 
Solvent accessibility agreement! 81.1% 0.04 
QMEANG score 0.785 0.17 


* Residue-level, secondary structure specific interaction potential using CB atoms as interaction centers. 


Range 3...25 A, step size: 1A 


PAll-atom, secondary structure specific interaction potential using all 167 atom types. Range 3...20 A, 


step size: 0.5A 


° Potential reflecting the propensity of a certain amino acid for a certain degree of solvent exposure 
approximated by the number of CB atoms within a sphere of 9 A around the centre CB. 
* Extended torsion potential over 3 consecutive residues. Bin sizes: 45 degree for the center residue, 90 


degree for the 2 adjacent residues 


° Agreement between the predicted secondary structure of the target sequence and the calculated 


secondary structure of the model. 


' Agreement between the predicted relative solvent accessibility using ACCpro (buried/exposed) and the 


relative solvent accessibility derived from DSSP (>25% accessibility => exposed) 


al. 2003). The prediction of ligand-binding sites was done 
using 


Figure 1 


3DLigandSite is a web server at 


Figure 2 


The built 3-D model of Mtb-Alr showing 


different secondary structure 


conformations. The helix is represented in 
orange, the strands in blue, and loops in 


yellow. 


Ramachandran plot 


All Ramachandrans: 5 
Chil-chi2 plots: 2 
Main-chain params: 6 
Side-chain params: 5 


Residue properties: Max. 


Bond len/angle: 5.0 


G-factors 


M/c bond lengths: 99.6% 


94.1% 
100.0% 


M/c bond angles: 
Planar groups: 


+ May be worth investigating further. 


Figure 3 


.8% core 


Dihedrals: 0.07 


Superposition of crystal structure of 
template 1XFC-A with built 3-D model of 
Mtb-Alr. The backbone of 1XFC-A model is 
in orange and the built 3-D model of Mtb-Alr 
is in blue 


371 residues 


4.5% allow 0.6% gener 0.0% disall 
labelled residues (out of 369) 

labelled residues (out of 186) 

better 0 inside 0 worse 

better 0 inside 0 worse 

18.2 


deviation: Bad contacts: 


Morris et al class: 1 
Covalent: -0.13 Overall: 


within limits 
within limits 


0.4% highlighted 
5.9% highlighted 


within limits 0.0% highlighted 


* Worth investigating further. 


Summary of PROCHECK analysis 
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http://www.sbg.bio.ic.ac.uk/3dligandsite (Wass, Kelley et 
al. 2010). 


3.6. NESTS Analysis 


Nests analysis is done over ProFunc server available at 
http://www. ebi.ac.uk/thornton-srv/databases/ProFunc (Pal, 
Suhnel et al. 2002; Watson and Milner-White 2002). 


3.7. Biological Assembly of 
Obligatory Dimer Functional Unit 


Structure 

The active form of the Alr enzyme is an obligatory dimer 
containing two monomers of 43 kDa in head-to-tail 
orientation. Residues from both monomers contribute to 
the two active sites. Biological assembly of obligatory 
dimer functional unit from monomer of built homology 
model is performed using SymmDock Web Server located 
at http://bioinfo3d.cs.tau.ac.il/SymmDock (Dina 
Schneidman-Duhovny, Yuval Inbar et al. 2005; 
Schneidman-Duhovny, Inbar et al. 2005). 


3.8. Generating Ligands and 


Optimization 

D-Cycloserine (DCS; 4-amino-3-isoxazolidinone) is a 
rigid analog of D-alanine is chosen as scaffold for the 
rational design of new  D-cycloserine analog 
antimycobacterial drugs, structurally related to DCS, and 
targeting enzymes of the D-alanine pathway of 
peptidoglycan biosynthesis. In this scenario of designing 
ligands, all the modifications were done by taking in to 
consideration of a database of substituents and spacers 
(linkers) obtained by substructure analysis of a collection 
of current drugs and development drugs. 100 lead 
molecules were designed with structural modifications of 
parent molecule using Molinspiration server located at 
http://www. molinspiration.com /services/search. html. All 
these molecules were then analyzed for their ability to 
follow Lipinski’s Rule-of-five by subjecting them to 
Molinspiration server at 
http://www. molinspiration.com/cgi-bin/properties. The 25 
high ranked lead molecules were prioritized to follow 
Lipinski’s rule-of-five based on the drug _likeliness 
properties. 


3.9. Protein-ligand Docking 

The docking of ligands to the active site of Mtb-Alr model 
was performed using AutoDock Vina software. AutoDock 
Vina (version is 1.1.2) is a new open-source program for 
drug discovery, molecular docking and virtual screening, 
offering multi-core capability, high performance and 
enhanced accuracy and ease of use (Trott and Olson 
2010). AutoDock Vina automatically calculates the grid 
maps and clusters the results in a way transparent to the 
user. Docking was performed to obtain a population of 
possible conformations and orientations for the ligand at 
the binding site. Using the software, polar hydrogen 
atoms were added to the Mtb-Alr protein and its nonpolar 
hydrogen atoms were merged. All bonds of ligands were 
set to be rotatable. All calculations for protein-fixed 
ligand-flexible docking were done using the Lamarckian 
Genetic Algorithm (LGA) method. The grid box with a 
dimension of 20 x 20 x 20 points was used around the 
catalytic triad to cover the entire enzyme binding site and 
accommodate ligands to move freely. The 25 high 
ranked DCS lead molecules were chosen for docking 
simulations on to Mtb-Alr model. 


4. RESULTS AND DISCUSSION 


The amino acid sequence of Mtb-Alr obtained from NCBI 
comprising of 408 amino acids and bearing a gi. 
No1449364 (NCBIGenBank: CAB01033.1; Cole, Brosch 
et al. 1998). The BLAST-P tool was performed by 
selecting database as PDB alone, and has resulted in a 
total of 56 blast hits over the query sequence with varied 
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Table 2 Results of WHAT IF checks of Mtb 
1st generation packing quality -0.615 
Ramachandran plot appearance 0.939 
chi-1/chi-2 rotamer normality -0.573 
Backbone conformation -0.770 
Bond lengths 0.960 
Bond angles 1.198 
Omega angle restraints 0.664 
Side chain planarity 0.300 
Improper dihedral distribution 1.051 
Inside/Outside distribution 0.946 


S 
7 
P= 
ob 
<7] 
= 


Figure 4 


Ramachandran plot for Mtb-Alr model from PROCHECK analvsis 
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alignment values. Specific hits were from Type III Pyridoxal 
5-phosphate (PLP)-dependent enzyme alanine racemase 
super family. These are proteins of fold type Ill PLP- 
dependent enzymes that play essential roles in the 
interconversion between L- and D-alanine, which is an 
essential component of the peptidoglycan layer of bacterial 
cell walls. In this case, PDB-ID: 1XFC-A, PDB-ID: 1VFH-A 
and PDB-ID: 2DY3-A were three high-scoring database 
matches that align to most of the query sequence. 1XFC-A, 
1VFH-A and 2DY3-A showed a query coverage of 94%, 
89% and 88% respectively. Among the obtained BLAST P 
results, the chain A crystal structure of alanine racemase at 
1.9 A resolution with PDB ID: 1XFC-A was identified to 
possess highest similarity to Mtb-Alr, and hence it was 
chosen as a template. 

The quality of Pairwise Sequence Alignment between 
the target and template sequences is the most important 
factor determining the accuracy of the homology model. The 
pairwise sequence alignment has shown the overall identity 
as 384/408 (94.1%), similarity: 384/408 (94.1%), gaps: 
24/408 (5.9%) and score: 1961.0 inferring a good homology. 
There were few gaps and variations in alignment of 
sequences which correspond to the sequences at the loops 
of structures. Multiple sequence alignment is also an 
important step for phylogenetic analysis, which aims to 
model the substitutions that have occurred over evolution 
and derive the evolutionary relationships between 
sequences. The outcome of multiple sequence alignment is 
presented a close evolutionary relationship with PDB ID: 
1XFC-A, 1VFH-A and 2DY3-A. 

MODELLER is used for homology or comparative 
modeling is a fully automated and constructs energy 
minimized protein models by satisfaction of spatial restraints 


extracted from the template PDB file 1XFC-A. The built 3-D 
model of Mtb-Alr showing different secondary structure 
conformations is shown in figure 1. Superposition of crystal 
structure of 1XFC-A with built 3-D model of Mtb-Alr is shown 
in figure 2. The percent identity of built 3-D homology model 
of Mtb-Alr compared to 1XFC-A when divided by longer 
sequence length was 98.65% and while divided by shorter 
sequence length was 100%. Therefore the built 3-D 
homology model of Mtb-Alr can thus be characterized as a 
good theoretical model for further analysis. 

Evaluation of model quality is a crucial step in homology 
modeling (Rodriguez, Chinea et al. 1998; Chivian and Baker 
2006; Gao, Xu et al. 2009). The assessment of protein 
structures is a delicate matter. Currently, there is not a 
single method able to consistently and accurately predict the 
three-dimensional structure of a protein. The results of 
QMEANGE score of the Mtb-Alr model are shown in Table 1. 

QMEANG is a composite scoring function which is able 
to derive both global (i.e. for the entire structure) and local 
(i.e. per residue) error estimates on the basis of one single 
model. QMEANG is a reliability global score for the whole 
model reflecting the predicted model reliability ranging from 
0 to 1. The quality estimate ranges between 0 and 1 with 
higher values for better models. QMEANG6 score of the built 
Mtb-Alr model is 0.785 indicating optimal model reliability 
(Benkert, Tosatto et al. 2008; Benkert, Kunzli et al. 2009; 
Benkert, Tosatto et al. 2009). ANOLEA performs energy 
calculations on a protein chain, evaluating the "Non- Local 
Environment" (NLE) of each heavy atom in the molecule, 
and GROMOS computer simulation package studies the 
biomolecular systems and can be applied to the analysis of 
conformations obtained by experiment or by computer 
simulation. ANOLEA and GROMOS results noticeably 
indicate the predominance of the negative energy values for 
a given amino acid representing favorable energy 
environment, and thus indicating a favorable packing quality 
the models. 

PROCHECK is to assess how normal, or conversely 
how unusual, the geometry of the residues in a given protein 
structure is, as compared with stereochemical parameters 
derived from well-refined, high-resolution structures. 
PROCHECK was used to obtain Ramachandran plot for 
Mtb-Alr homology model to understand the distribution of 
@/w angles of amino acids. The results are shown in the 
figure 3 and figure 4. 94.8 % of the amino acids are in the 
most favored regions, 4.5 % are in the additional allowed 
regions, 0.6 % are in the generously allowed region and 
0.0% in the disallowed regions. Further the parameters of 
PROCHECK have given an insight of main-chain properties 
and five side chain parameters of Mtb-Alr model which has 
shown to contain no bad contacts and all the stereochemical 
parameters are in par with the standard values. 99.6% of 
main chain bond lengths, 94.1% main chain bond angles 
100.0% of planar groups are within limits. The overall G- 
value was observed to be within the permitted range with G- 
factors for dihedrals as 0.07, for covalent as -0.13 and 
overall is 0.00. All these results of PROCHECK 
demonstrated a positive indication of the reliability of Mtb-Alr 
homology model for docking studies. 

The results of WHAT IF checks on the Mtb-Alr model 
are shown in Table 2.The first part of the table shows a 
number of global quality indicators. The second part of the 
table mostly gives an impression of how well the model 
conforms to common refinement restraint values. The 
distribution of residue types over the inside and the outside 
of the protein is normal with RMS Z-score of 0.946. 
Structural average packing environment was satisfactory 
.The structural average packing score is-0.746 and is within 
normal ranges. No missing atoms were detected in residues 
and all expected atoms are present in residues. All 
parameters are within the range of templates, and can be 
inferred that the developed model is of good quality. The 
ProSA z-score indicates overall model quality and its values 
are displayed in a plot that contains the z-scores of all 
experimentally determined protein chains in current PDB. 
ProSA-web analysis of Mtb-Alr model is represented as 
figure 5, and has shown the Z-score value of -8.67 and is in 
the range of native conformation of similar size template 
PDB:1XFC-A which is-8.3. PDBsum secondary structure 
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Figure 5 


400 600 800 
Number of residues 


400 600 800 
Number of residues 


ProSA-web analysis:Z-score plot of Mtb-Alr Model (left) and Z-score plot of PDB:1XFC-A (Right) 
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Figure 6 
Analysis of PDBSum: Wiring diagram of Mtb-Alr Secondary structure 


summary indicated a total of 371 residues distributed as 88 
residues (23.7%) as strands, 113 residues (30.5%) as alpha 
helix, 19 residues (5.1%)as 3-10  helixand 151 
residues (40.7%) as other secondary structures. Wiring 
diagram of Mtb-Alr secondary structure is shown as figure 6. 

InterProScan has identified the built model sequence 
with Alanine racemase signature as evident by InterPro 
entry accession number IPRO00821. The Alr, N-terminal 
domain is identified at 6-228 residues by InterPro entry 
accession number IPR001608. The Alr, C-terminal domain 
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is identified at 240-368 residues by InterPro entry accession 
number IPRO11079. The = Alr, —pyridoxal-phosphate 
attachment site is identified at 29-39 residues by InterPro 
entry accession number IPRO20622. The PLP-binding barrel 
superfamily signature is identified at 7-240 residues by 
superfamily HMM library identifier SSFS1419. The results of 
Scansite were aligned with the InterProScan, and Alr, N- 
terminal domain is identified at 6-228 residues and Alr, C- 
terminal domain is_ identified at 240-368 residues. 
3DLigandSite utilizes protein-structure prediction to provide 
structural models for proteins that have not been solved. 
Ligands bound to structures similar to the query are 
superimposed onto the model and used to predict the 
binding site. The predicted binding site for Mtb-Alr homology 
model is of the residues Val 64, Lys 66, Tyr 70, Trp 112, His 
196, Tyr 199, Asn 236, Ser 237, Arg 252, Pro 253, Gly 254, 
lle 255 and Tyr 388. The predicted bindings site us shown in 
the figure 7. 

ProFunc is a web server for predicting the likely function 
of proteins whose 3D structure is known but whose function 
is not (Pal, Suhnel et al. 2002; Watson and Milner-White 
2002).Nests are structural motifs that are found in 
functionally important regions of protein structures. 16 nests 
were located in this built model chain with the residue 
conservation score of 1.0. The conservation score ranges 
from 0.0, signifying that the residue is not at all conserved, 
to 1.0, which indicates it is perfectly conserved. While five 
residue ranges demonstrated the nest score above 2.0. A 
nest score above 2.0 is suggestive of the nest being a 
functionally significant one. The five residues are residue 
ranges are Asp268-Gly270 (Asp268, Met269 Gly270) with 
nest score 4.0 , Pro368-Arg371 (Pro368, Gly369, Ile370, 
Arg371) with nest score 3.89, Gly71-Gly73 (Gly71, His72, 
Gly73) with nest score 2.0, Thr160-Asn165 (Thr160, Gly161, 
Leu162, Asn163, Arg164 , Asni65) with nest score 2.0 and 
Gly324-Leu326 (Gly324,Arg 325, Leu326) with nest score 
2.0. Collectively, the developed model showed a good 
overall structural quality and stereochemical parameters 
compliance as confirmed by different validation tools. 
SymmDock is an algorithm for prediction of complexes with 
Cn symmetry by geometry based docking. Given the 
structure of the asymmetric unit of the multimer complex, 
SymmDock predicts the structure of the entire complex. The 
output of the method is a list of complexes that fulfill the 
cyclic symmetry constraints. The symmetry constraints were 
cross verified with that of PDB ID-1XFC Biological assembly. 
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Figure 7 


Predicted binding site of Mtb-Alr 


Table 4 Results of multiple docking runs performed with AutoDock Vina over Mtb-Alr and DCS lead molecules 

DCS 1 -4.2 11.73 10.9 
DCS 2 -4.3 12.53 11.76 
DCS 3 -4.4 11.65 11.06 
DCS 4 -4.4 9.61 9.00 

DCS 5 -4.5 13.53 12.99 
DCS 6 -4.6 11.25 10.56 
DCS 7 -4.6 10.70 10.42 
DCS 8 -4.6 15.09 14.26 
DCS 9 -4.6 14.23 13.79 
DCS 10 -4.7 14.07 13.62 
DCS 11 -4.7 6.25 5.63 

DCS 12 -4.8 17.99 16.90 
DCS 13 4.9 12.90 12.44 
DCS 14 -5.1 8.64 8.21 

DCS 15 -5.3 7.42 5.87 
DCS 16 -5.5 16.59 15.95 
DCS 17 5.5 5.83 4.81 

DCS 19 -5.6 9.40 8.84 
DCS 22 -5.9 12.84 11.82 
DCS 21 -6.5 10.39 9.32 

DCS 25 -6.5 13.07 11.95 
DCS 20 7.3 10.59 9.38 
DCS 23 -7.7 8.71 7.88 
DCS 24 -7.8 7.74 6.34 
DCS 25 -9.1 7.26 6.60 

* Root Mean Square Deviation/Upper Bound 

» Root Mean Square Deviation/Lower Bound 


AutoDock: 
AutoDock is an 
automated 
procedure for 
predicting the 
interaction of 
ligands with 
biomacromolecular 
targets 
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Figure 8 


3-D Biological assembly of obligatory dimer 
functional unit represented as ribbon. Chain A is 
represented in violet color and chain B as red 
color. Secondary structure conformations are 
represented in the following color code: helix is 
represented in orange, the strands in blue, and 
loops in yellow 


Biological assembly of obligatory dimer functional unit is 
shown in figure 8. 

The 25 high ranked lead molecules were prioritized to 
follow Lipinski’s rule-of-five based on the drug likeliness 
properties is listed in table 3. Docking of small molecule 
compounds onto the binding site of a receptor and 
estimating the binding affinity of the complex is an important 
part of the structure based drug design process. A docking 
study usually starts with the definition of a binding site, in 
general a restricted region of the protein. The results of 
multiple docking runs are automatically analyzed and a 
ranked list of the docked poses is generated in terms of 
predicted binding affinity in kcal/mol. The results of multiple 
docking runs performed with AutoDock Vina over Mtb-Alr 
and DCS lead molecules is represented in Table 4. All the 
lead molecules have shown interactions with the Mtb-Alr 
active site amino acids determined earlier. These 
interactions may be due to the formation of H-bonds or by 
the establishment of vanderwalls forces. DCS analog 25 
(DCS 25) achieved a good convergence with the best- 
docked conformations having the lowest binding compared 
to other series of DCS inhibitor analogs with lowest binding 
energy -9.1 Kceal/mol. 
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Table 3 DCS lead molecules — 


Molecular 


(3R,4S,5S)-3-amino- 


roperties including Rule of 5 parameters and molecular dru 


-likeness 


CC2=CC=CC=C2 


3,5-dione 


Desi C6H11NO2 CCIC(OC(=0)C1N)C | 4,5-dimethyloxolan- -1.393 52.328 9 29.159 124.655 
2-one 
3- 
Dcs2 C5H8N202 CNCICC(=0)NC1=0 | (methylamino)pyrroli | -1.356 58.196 9 28.131 114.755 
dine-2,5-dione 
Dcs3 C4HEN202 CON1C=CNC1=0 S Mmethoxy IE -0.403 47.03 8 4.104 98.925 
imidazol-2-one 
a sy 4-(methylamino)-2H- 7 
pcs4 C5H7NO2 CNC1=CCOC1=0 ae eae 0.943 38.332 8 3.116 102.943 
bess C5H7NOS COC1=CSC=C1N ae | . eee 35.257 | 8 29.184 111.59 
2,4-diamino-1,4- 
DCS6 C3H6N40 C1(C(=O)NC(=N1)N)N- | dihydroimidazol-5- -2.013 93.508 8 4.108 95.844 
one 
pes? C3H4N20S C1=C(C(-0)NS1)N_ | *AmingTZthiazol |g o7g 58.885 | 7 6.145 90.013 
3-amino-1- 
DCS8 C6H12N20 CCN1CCC(C1=0)N diivinmratcareone | 2988 46.332 9 28.175 28.643 
1- 
Dcs9 C4H9N3O C1CN(C(=0)N1)CN (aminomethyl)imidaz |  -1.096 58.359 8 5.136 07.657 
olidin-2-one 
DCS10 C3H8N202 CC(C(=0)NO)N eamino-N. 1.692 75.349 7 04.109 96.716 
a hydroxypropanamide i : : : 
1-methyl-5- 
Desi C4H10N40 CNC1C(=0)NNNIC (methylamino)triazoli | -1.196 56.39 9 30.151 20.717 
din-4-one 
= a 4-benzyl-2-methyl- 
pcesi2 | C10H10N202S CNIC(=O)N(C(=O)S1)C | 4 5 4 thiadiazolidine- 1.155 44.01 15 222.269 88.228 
C2=CC=CC=C2 
3,5-dione 
pesi3 C6HENOS ccoci=c(c=csin | Fethoxyihiopnens | 4 565 35.257 | 9 43.211 28.391 
F[l(D-(3-nitro-6- 
CIC(C(=O)NO1)NC=C2 | *XCcycohexa-2.4- 
Dcst4 C10H9N305 Sec eaSGeO)NaIRO ddeneinettanih -0.165 113.254 | 18 251.198 201.922 
0]-1,2-oxazolidin-3- 
one 
2.5- 
DCS15 C5H10N20 C1CC(C(=0)C1N)N diaminocyclopentan- | -2.389 69.117 8 114.148 110.613 
1-one 
N-(2-acetyl-3-oxo- 
CC1=CC=C(C=C1)S(=O 1,2-oxazolidin-4-yl)- 
DCSi6 C12H14N2058 | )(=0)NC2CON(C2=0)C( 4- -0.026 92.783 | 20 298.32 243.523 
=0)C methylbenzenesulfo 
namide 
ej (4R)-4-amino-1 ,2- 7 
DCS17 C3H6N202 C1C(C(=0)NO1)N Se eieseee 1.898 64.355 7 02.093 87.082 
= (4S)-4-amino-1 ,2- 7 
DCSi8 C3H6N202 C1C(C(=0)NO1)N Cee 1.898 64.355 7 02.093 87.082 
ae = 4-ethyl-2-propyl- 
Desig C7H12N2028 samc ai 1,2,4-thiadiazolidine- 0.815 44.01 12.0 88.252 166.984 
3,5-dione 
: z 2,4-diethyl-3- 
DCS20 C6H10N20S2 sees ere sulfanylidene-1 ,2,4- 0.655 26.939 1 90.293 159.06 
thiadiazolidin-5-one 
id _ 4-butyl-2-ethyl-1,2,4- 
pcs21 C8H14N2028 pe or a = | thiadiazolidine-3,5- 1.374 44.01 13 202.279 183.786 
dione 
4-ethyl-2-propan-2- 
CCN1C(=0)N(SC1=0)C yl ,2,4- 
pcs22 C7H12N2028 oc oe eS: 0.675 44.01 12 188.252 166.769 
dione 
ethyl 2-(2-methyl- 
CCOC(=0)CN1C(=0)N( 3,5-dioxo-1,2,4- 
DCS23 C7H10N2048 SC1LO}C hoae tan ae -0.202 70.315 14 218.234 178.15 
yl)acetate 
7 7 3,6- 
pcs24 C6H12N404 C(CIC(=O)NC(C(=O)NT_ | bis aminooxymethyl) 2.9 128.71 14 204.186 172.371 
)CON)ON is(amir t 
piperazine-2,5-dione 
= = 4-benzyl-2-ethyl- 
pces25 | C11H12N2028 CONIC(=O)N(C(=O)S1) | 4 5 4 -thiadiazolidine- 1.531 44.01 16 236.296 205.03 


*Octanol/water partition coefficient 
» Topological polar surface area 


4Number of Rotatable Bonds 
°Molecular volume 


Maruthi Prasad Palthur et al. 


Antimycobacterial Drug Design: Homology Modeling and Docking Studies on Mycobacterium tuberculosis Alanine Racemase, 


Drug discovery, 2013, 3(8), 27-36, 


© The Author(s) 2013. Open Access. This article is licensed under a Creative Commons Attribution License 4.0 (CC BY 4.0) 
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FUTURE ISSUES 


1.Computational predicted data should be validated using suitable functional assays for further consideration. 

2.Pre-clinical analyses of designed analogues are necessary to accurately understand its molecular mechanism of action and pharmacological efficiency 
to conclusively state them as an anti-mycobacterial analogue 

3.Calculate ADME/T properties of the designed ligands using the ADME/T tools in future 

4.Further research is necessary to assess the lethal targets in the D-alanine branch of peptidoglycan biosynthesis in mycobacteria. This information is 
necessary for the development of new antimycobacterial agents targeting the D-alanine pathway. 


5. CONCLUSION 


The emergence of emergence of MDR-TB and XDR-TB 
makes the discovery of new molecular scaffolds a priority. 
Considering the urgency of the current situation which even 
necessitates the re-engineering and repositioning of some 
old drug families, drug discovery strategy for the incremental 
improvements of existing scaffolds is to fill a drug 
development pipeline is chosen for the purpose of this study. 
Due to its essential nature, coupled with the absence of a 
human homolog, an essential and uniquely prokaryotic 
enzyme Alr has long been an attractive drug target and has 
long been pursued as a target for antimycobacterial drug 
discovery. D-Cycloserine is a rigid analog of D-alanine is 
chosen as scaffold for the rational design of new D- 
cycloserine analog antimycobacterial drugs targeting 
enzymes of the D-alanine pathway of peptidoglycan 
biosynthesis. This study highlights the feasibility of obtaining 
novel Alr inhibitor lead compounds by incremental 
improvements of existing scaffolds is to fill a drug 
development pipeline. For the rational structure-based 
design of drugs, knowledge of the three-dimensional 
structure of the target protein is indeed inevitably required. 
Thus, referring to suitable reference structures, well spread 
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